DESY 95-069 
hep-ph/9504297 

On particle— like jets 

Hermann HeBling 
Deutsches Elektronen-Synchrotron (DESY) 
NotkestraBe 85, D-22603 Hamburg, Germany 

February 1, 2008 



Abstract 

Under which conditions does a jet appear as a particle-like signal from 
the hidden realm of quarks and gluons? Motivated by this question jet 
clustering conditions are formulated, in order to characterize jet cluster- 
ing algorithms, which can be used for a determination of particle-like jets. 
Jets are understood as particle-like, if they behave like free particles. The 
simplest solution to the jet clustering conditions leads to a new jet algo- 
rithm: a Lorentz invariant generalization of the JADE algorithm. It is 
found that this generalization amplifies hadronization effects in certain 
phase space regions in such a way, that hadronization models might be- 
come testable in jet physics at the electron-proton collider HERA. More- 
over, a method is suggested, which can be used at HERA, in order to 
determine a region in the phase space, where hadronization effects from 
the proton remnant are small and where parton jets are particle-like. 

1 Introduction 



Quarks and gluons do not exist: they cannot be investigated experimentally 
as free particles. While the mass of a free particle is an observable, the mass 
of a quark is not well defineable. Only gauge invariant collections of quarks 
and gluons are physically meaningful. Because of this confinement property it 
is difficult to obtain observable predictions from QCD. Various strategies were 
developed to obtain approximate predictions. A perturbation theory at short 
distances can be formulated, since QCD is asymptotically free. It provides 
deep insights in the dynamical structures of QCD. But the hadronization of 
quarks and gluons cannot be explained within the perturbative approach. For 
a description of hadronization phenomenological models have to be used. 
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In high-energy coUision experiments hadron jets, i.e. spatially isolated clus- 
ters of hadrons, are observed. An interesting question is: under which conditions 
does the direction of hadron jets agree with the direction of partons from per- 
turbative QCD? An attempt to answer this question, has to take into account 
that partons are colored and hadrons are colorless. 

Monte Carlo event generators were developed, which connect perturbative 
QCD and hadronization models, like LEPTOQ [ l] . They can be used to study the 
influence of hadronization. Parton jets and hadron jets are determined event by 
event by applying a jet algorithm to the outgoing partons of perturbative QCD 
and the hadrons from the phenomenological hadronization model, respectively. 
The differences between parton jets and hadron jets are caused by hadronization 
and are called hadronization effects. Hadronization effects depend strongly on 
the jet algorithm, see e.g. §,§,!,!,§,§. 

A jet algorithm is a tool, to analyze the hadronic final state and to provide 
a basis for testing the underlying theory. There are many jet algorithms. One 
reason to create a jet algorithm is, to make the calculation of special aspects 
of the theory possible, see e.g. the Geneva algorithm Q and the kT-algorithm 
Jet algorithms motivated by QCD-calculations allow a detailled view into 
certain regions of the phase space, but they might not be suited equally in 
different regions. For example, in the deep inelastic scattering (DIS) region 
at the electron-proton collider HERA it is expected, that the kT-algorithm 
describes jets the better, the more they are separated from the proton remnant, 
i.e. the more the proton remnant "factorizes" from other jets. In section 2 a new 
jet algorithm is derived, not from explicit QCD-calculations, but from general 
physical considerations. This jet algorithm is a Lorentz invariant generalization 
of the JADE algorithm. It can be applied to all areas of jet physics, as long as the 
jet masses are small. Using the DJANGO Monte Carlc|^ Q hadronization effects 
are compared in section 3 for the JADE algorithm and the new jet algorithm. 
A variable is introduced, which assigns a Lorentz invariant distance between 
the jets and the incoming proton. It is shown that jet masses influence strongly 
the hadronization effects, if the jets are close (with respect to this variable) to 
the incoming proton. The new jet algorithm seems to be a sensitive tool to test 
hadronization models in jet physics. 



2 Formulation of jet clustering conditions 

We are interested in a description of jets, which behave like free particles and 
which we call particle-like jets. In this section we formulate four conditions for 
jet clustering algorithms. They lead to a new jet algorithm, which can be used 

^ LEPTO links leading order QCD matrix elements (ME) of the order Os, parton shower 
(PS) and the LUND string fragmentation model. 

DJANGO is an interface between HERACLES, a Monte Carlo for deep inelastic lepton- 
proton scattering including electroweak radiation corrections [H, and LEPTO. 



2 



to determine particle-like jets. 

Clustering algorithms have been developed i n pO| , , . We concentrate 
on clustering algorithms of the following kind p3|7 Consider a collision event 
consisting of N hadrons with momenta pi, . . . ,pN and assign a distance dij, 1 < 
i < j < A^, to all pairs of hadrons.^ By a renumbering of the hadrons one always 
manages that dijv is the smallest of all distances. If di^ is smaller than some 
reference distance 

d?Ar < ycutM\ (1) 

recombine hadron A'' with hadron 1 and assign a 4-tuple to the recombined 
hadrons. We call the 4-tuple pseudo-momentum and characterize it by the 
symbol pi (B pn- (In general it is not justified to consider pi® Pn as a momen- 
tum.) M is called reference mass and j/cut is called jet resolution parameter. 
This clustering procedure is repeated until none of the distances fulfils the 
inequality (|l|). The remaining hadron clusters define the jet content of the event. 
These jets are called hadron jets. 

In perturbative QCD jet algorithms can be applied to partons in order to 
obtain results, which are free of infrared and collinear singularities (see e.g. 
[p!5|). The resulting jets are called parton jets. 

Commonly used clustering algorithms (see e.g. [||, and 

references therein) differ by the reference mass M, the distance dij, and the 
recombination prescription pi © pj . 

Hadrons, produced in a high-energy particle collision, propagate almost in- 
dependently. Consequently, the forces between hadron jets are small. Parton 
jets, on the other hand, are influenced by hadronization effects because of the 
color forces between the partons. If the color forces between parton jets are 
small, parton jets behave like free particles and are characterizable by kinemat- 
ical quantities. 

We assume that a momentum is sufficient to describe a particle-like jet 
and that internal degrees of freedom, like spin, can be neglected. An event 
consisting of n jets, where each jet is particle-like, is called particle-like n- 
jet. Energy-momentum conservation suggests, to identify the total momentum 
of a particle-like n-jet with the total momentum of its hadrons or partons, 
respectively. What can be said about the momentum of an individual jet within 
a particle-like n-jet event? Consider an event consisting of 2 hadrons with the 
momenta pi and P2 . If the 2 hadrons are recombined, the only way to guarantee 
momentum conservation is, to assign the momentum pi + P2 to the 1-jet. By 
induction one is led to the 

A) Recombination condition: The pseudo-momentum has to be identified with 

The jet clustering algorithm ARCLUS assigns a distance diji^ to all triples of hadrons 

0- 
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the sum of the momenta to be recombined 



Pi®Pj = Pi+Pr 



(2) 



In the current calculations of perturbative QCD parton jets are massless 
(see e.g. ||l5|, and references therein). Recombination prescriptions were 
proposed in the literature which guarantee that the masses of the hadron jets 
are also zero, e.g. pi (B Pj = {\pi + Pj\,Pi + Pj)- These prescriptions are arbi- 
trary, because there are uncountable many ways to map the hadron momenta 
into massless hadron jets. This arbitrariness it reduced, but not completely, if 
one restricts to recombination prescriptions which lead to small hadronization 
effects. One is still left with the problem that the recombination prescription is 
not universal, since it depends on a test variable, which is used as a measure for 
the strength of the hadronization effects. Moreover, if the pseudo-momentum 
is not simply the sum of the two recombined momenta, pi © pj Pi -\- Pj , the 
physical meaning of the pseudo-momentum is not clear and in particular one 
cannot expect, that the direction of a hadron jet can be identified with the 
direction of a parton jet. 

Another way to treat the masses of jets, is to compare data with a more 
complex evaluation of QCD matrix elements. This, of course, requires the cal- 
culation of higher than leading-order corrections. 

The reference distance on the right hand side of (|^) is defined as the product 
of the reference mass squared and the jet resolution parameter j/cut- If the 
jet resolution parameter j/cut is chosen sufficiently large, the right hand side 
of (|l]) can be made larger than every distance squared dfj and, consequently, 
all hadrons momenta pi, - ■ ■ ,pn are finally recombined into a 1-jet with the 
momentum pi + . . . + pj^. In the hadronic center of mass frame a 1-jet is at 
rest and its energy is the mass of the hadronic final state. Let j/cut.triv be the 
smallest j/cut value, at which the hadrons of an event are recombined into a 1-jet. 
Because a 1-jet is a trivial jet, it does not make sense to consider values of the 
jet resolution parameter higher than t/cut,triv, i-e. the jet resolution parameter 
has to be restricted to the interval < ?/cut < J/cut.triv By a suitable rescaling 
of the reference mass M, one always achieves that 2/cut.triv = 1- 

B) Boundary condition: The reference mass M has to be chosen such that, 
for y cut = 1; o-ll hadrons are recombined into a 1-jet and that, forycut < !> 
only higher order n-jets, n > 1, are generated. 

A simple solution to this condition is obtained by an identification of the 
square of the reference mass with the sum of the squares of all distances between 
the hadrons 




(3) 



i<j 
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This choice is not unique. For example one could weight each distance dij in 
with a factor Wij > 1, but Wij = 1 is distinguished, since it leads to the smallest 
reference mass M. 

The reference mass M changes, in general, from clustering step to clustering 
step. On the one hand this might cause problems in theoretical calculations, 
but on the other hand there is a lot of freedom to tune the reference mass, e.g. 
by means of the weights Wij . 



Finally we formulate two conditions on the distance dij . 

Almost all commonly used jet algorithms are not Lorentz invariant. Conse- 
quently the jet predictions of these algorithms depend on the chosen reference 
frame. This causes experimental problems, if the laboratory frame does not 
agree with the reference frame, suited for a non-Lorentz invariant jet algorithm. 
Due to the strong energy imbalance between the electrons and protons at HERA, 
the laboratory frame does not belong to one of the theoretically preferred refer- 
ence frames, like the hadronic center of mass frame or the Breit frame. Besides 
the experimental errors in the measurement of jets, a considerable source of 
additional uncertainty is introduced at HERA through an event by event mea- 
surement of certain kinematical observables. These kinematical observables are 
needed in order to determine the transformation between the laboratory frame 
and the reference frame of a non-Lorentz invariant jet algorithm. The addi- 
tional uncertainty might reduce the applicability of a non-Lorentz invariant jet 
algorithm. 

Lorentz invariant jet algorithms are very convenient, at least from an ex- 
perimental point of view, since they can be used in the laboratory frame. The 
laboratory frame is the only natural frame to study detector acceptance effects. 
To find the proper acceptance cuts is one of the most important and most dif- 
ficult experimental tasks. But also from a theoretical point of view Lorentz 
invariant jet algorithms are preferred, if one is interested in particle-like jets 
and if one assumes, that particle-like jets belong to the observables of QCD. 
Since QCD is Lorentz invariant^, the determination of particle-like jets cannot 
depend on the inertial reference frame where the jet algorithm is applied. This 
leads us to the 

C) Lorentz invariance condition: The distance dij has to be independent of 
the inertial reference frame. 



In QCD the radiation of coUinear partons is more probable than the radiation 
of non-coUinear partons, if one lets aside infrared partons. CoUinear partons 
are recombined with the highest priority, if one assigns a vanishing distance 
between them. This motivates the 

* A short discussion of the problem, whether the Lorentz invariance might be broken in 
the vacuum of gauge theories, can be found |17|. 
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D) Collinearity condition: The distance between collinear jets should vanish 

d^, - 0, ifp^^^Apf. (4) 

Collinearity is understood as collinearity of 4-momenta. The collinearity 
factor A has to be positive, because of the positivity of the energy. 

Let us analyze the consequences of the clustering conditions C) and D). 
Because of the Lorentz invariance condition C) the distance dij is a function 
of the invariants , , PiPj ■ We are therefore able to expand 

dl ^ 1 ^ 

= ao + ^afcrfc + ^ ^ afcjrfcr, + ... (5) 

fc=l ' k,l = l 

into powers of the invariants 

For a moment let us assume that the jet masses mj,mi are small compared to 
the reference mass M. If in addition the hadrons are almost infrared {pi ~ 0), 
or almost collinear {pj ^ Apf with a collinearity factor A of the order of 1), 
one also has piPj <ti M^. Then it is allowed to stop the expansion (|^) at the 
quadratic level. It turns out that the clustering condition D) determines the 
distance dij up to an overall normalization factor c = 033 

dfj = c {{p^ + pjf - (mj + mjf) , (7) 

i.e. we are able to derive a distance from the clustering conditions C) and D), 
which is unique (up to normalization) for small jet masses. 

The normalization factor c has to be positive, otherwise the distance dij 
is not a real number (to see this, use piPj > minij). If the reference mass is 
identified with (^, the normalization factor c does not influence the inequality 
of the clustering algorithm. In this case one can set c—\. 
In the limit of massless jets the distance (^ becomes proportional to the 
distance of the JADE algorithm [|6|, ||l|l, [|l| 

d^j — > c2EiEj{l — cos9ij), as nii,mj — > 0. 

The distance dij would not become the JADE distance in the limit of small jet 
masses, if the invariants in (^) were defined without the square root. 

During a clustering procedure the mass of a particlc-like jet increases. Prac- 
tically one arrives at jet masses which are large compared to the masses of the 
individual hadrons within a jet and not negligible against the reference mass. 
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Beyond the first clustering step the JADE algorithm violates Lorentz invariance 
more and more, because its distance 2EiEj{\ — cos dij) is Lorentz invariant only 
for massless jets. 

The distance of the E-clustering algorithm df^ — {pi +pj)^, is 

Lorentz invariant, but it does not fulfil the coUinearity condition D), if the jets 
are massive. 

It should be noticed that (Q) does not define a distance in the classical sense. 
If it were a classical distance the triangle inequality dij + dj^ > do, would hold, 
but the triangle inequality is violated because of the coUinearity conditon D). 
To see this, consider e.g. three hadrons with momenta Pi,P2,P3 and assume 
that two of them are coUinear: pa — Xp2; then the sum of two of the distances, 
di3 + ^23 = ■\/A(ii2, is smaller than the third distance, c?i2, for all A < 1. 

Combining and (0) the square of the reference mass becomes (with c=l) 
formally similar to the distance (0) 

= W^ - {mi + TO2 + ■ ■ 0^ (8) 

where — {pi + . . . + PnY the square of the mass of the hadronic final 
state. We see that the reference mass M can be chosen smaller than the mass 
of the hadronic final state W without contradicting the boundary condition B). 

If the jet masses mi,mj are large, higher order corrections in (|) have to 
be taken into account and the distance dij is no longer unique because the 
condition C) and D) are not violated if the right hand side of (Q) is multiplied 
by any positive function f{mi,mj,pipj). 

The distance (|^) with c = 1, the reference mass (^) and the recombination 
prescription (^) define a new jet clustering algorithm, which is the simplest 
algorithm allowed in the setting. This jet algorithm can be applied to all areas 
of jet physics, as long as the jet masses are small compared to the reference mass. 
It is suited for the investigation of hadronization effects around remnants and 
the determination of particle-like parton jets, as is demonstrated in the next 
section in the case of deep inelastic electron-proton scattering. Theoretical 
calculations based on this jet algorithm are well defined, since collinear and 
infrared singularities are identifiable. For certain theoretical problems, e.g. the 
exponentiation problem, it might be indicated, to modify the distance by 
higher order mass corrections and/or to use a different reference mass M. 

3 Hadronization effects 

In this section we present a study of hadronization effects for the JADE algo- 
rithm, called jet algorithm 1, and its simplest generalization to massive jets, 
as presented in the last section, called jet algorithm 2. The jet algorithm 
1 is not Lorentz invariant and needs the specification of an inertial system. 
We consider it in the hadronic center of mass frame. Using the DJANGO 
Monte Carlo M we generate 5000 deep inelastic scattering events of 26.7 GeV 
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electrons and 820 GeV protons. DJANGO is able to take into account cor- 
rections from the bremsstrahlung of the electron. Let ppi-ot,Pe,Pe' ,Pj be the 
momenta of the incoming proton, the incoming electron, the scattered electron 
and the bremsstrahlungs photon, respectively, and let us define = —q^ and 
Qhad = -9had> where q ^ - p^, and qhad = q ~ P-y- The events are gen- 
erated with > 80 GeV^. This choice restricts xbj = Qhad/(2Pprot<Zhad) to 
XBj > 10-3. 

LEPTO offers different options to study various aspects of the electron- 
proton scattering: a matrix element (ME) option to generate 1-1-1 or 2-1-1 parton 
jets f\ according to leading order QCD matrix elements of order Ug, a parton 
shower (PS) option, and a fragmentation option for the generation of hadrons 
out of partons based on the LUND string fragmentation model |l^ . Since these 
options can be combined freely, partons at two different levels can be extracted 
from LEPTO: a ME-parton level and a PS-parton level. In the following we 
identify the parton level with the ME-parton level. In LEPTO the generation 
of the ME-partons is based on the JADE algorithm. We generated the ME- 
partons with the jet resolution parameter ?/cut = PARL(8) = 0.008. The parton 
jets and hadron jets are determined with the higher value ?/cut = 0.02, in order to 
guarantee, that the available phase space for the 2-1-1 jets of the jet algorithm 2 
at 2/cut = 0.02 is (for the most part) included in the phase space for the 2+1 jets 
of the jet algorithm 1 at ycut = 0.008. (According to the PROJET Monte Carlo 
[ p0| contributions from 3-1-1 jets are small at ?/cut — 0.02.) We order the parton 
jets and the hadron jets according to their distance to the proton (ijct,prot- The 
jet, which is closest to the proton direction, is considered as the remnant jet, 
the next jet is called jet 1. 

First we concentrate on the hadron level and compare the transverse mo- 
mentum (pt) distributions of the jet 1 for the jet algorithms 1 and 2 (grey 
distributions in figure la and 2a; the hatched part shows hadronization effects, 
see later). The pt distributions differ strongly. Both jet algorithms show up a 
maximum around pT = 10 GeV , but the jet algorithm 2 finds much more low 
Pt jets, i.e. the jet algorithm 2 is much more sensitive in the low pT region, 
than the jet algorithm 1. The different predictions of the low pr^behavior is 
caused by the mass of the jets, although the mean mass of the hadron jets is 
only of the order of 10 % of the reference mass M . 

The low p^-behavior of the jet 1 is dominated by hadronization effects. This 
can be read off from the figures lb and 2b, which show the scatter plots of the 
transverse momentum pT of the jet 1 (hadron level (HL) versus parton level 
(PL)) for the jet algorithm 1 and the jet algorithm 2, respectively. From the 
figures lb and 2b follows moreover, that the major part of the hadronization 
effects is characterizable by a transverse momentum pt(IIL) < 5 GeV . 

The transverse momentum of the jets is often used to control hadroniza- 
tion effects. We suggest another method. Consider the square of the distance 

^ The proton remnant is counted as jet. 
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between the proton and jetl scaled by the square of the reference mass 

* _ "jetl.prot 

The figures Ic and 2c show the influence of the cut 

zl > 0.3 (9) 

on the scatter plots lb and 2b. As can be seen a lot of the offdiagonal events are 
removed for both jet algorithms, i.e. the cut specifies a region in the phase 
space, where parton jets can be considered as particle-like, at least with respect 
to the transverse momentum of the jet 1. A p-r-cut acts "horizontally" in the 
figures lb and 2b, a z*-cut acts almost "diagonally". Contrary to a pT-cut, 
which depends on the reference frame, a z*-cut is Lorentz invariant and can 
always be applied in the laboratory frame. The hatched distributions in the 
figures la and 2a show the transverse momentum of the jet 1 for those events, 
which do not fulfil the z*-cut (^. By a measurement of the py-distribution of 
the jet 1, the increase below = 6 GeV, predicted by the jet algorithm 2, might 
be observable at HERA. Since the increase is caused by hadronization effects, 
the jet algorithm 2 could be useful to test hadronization models experimentally. 

The scaled distance between the the jetl and the remnant z* — rfj^eti, remnant /-^^ 
is less suited practically to identify hadronization effects, since most of the rem- 
nant escapes undetected in the beampipe. While the momentum of the proton 
is known, the momentum of the remnant can only be reconstructed from the 
missing momentum in the detector. 

Finally we show, that a 2*-cut is also useful to control hadronization effects 
for 2+1 jets. 2+1 jets are important, since they can be used to measure the 
strong coupling constant a^. Let us consider the variable 

^ _ 'Shad 



QLd + (Pjctl +Pjot2)2 

The figures 3 and 4 show the ry-scatter plots (hadron level versus parton level) for 
the 2+1 hadron jets of the jet algorithms 1 and 2, respectively. In the figures 
3a and 4a no hadronization cuts are applied. The correlations are very bad, 
indicating strong hadronization effects for both jet algorithms. After applying 
the 2;*-cut (||) the correlations are considerably improved (figures 3b and 4b). 
Since strong z*-cuts lower the statistics, one has to look for a compromise 
between the strength of the hadronization effects and the statistics of the data. 
The "best" z*-cut is the cut, at which the uncertainty for the reconstruction of 
the parton level from the hadron level is comparable to the uncertainty for the 
reconstruction of the hadron level from the detector level. 
Another important variable to characterize 2+1 jets is 

_ PprotPjctl 
Zp — 



Pprot<Zhad 
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Since in the limit of small masses 

one can consider z* as a generalization of Zp to massive jets. The mean mass 
of hadron jets is of the order of 10 % of the reference mass M and therefore Zp 
differs not very much from z* practically. This means that a measurement of 
Zp cannot be performed below the ^*-cut. 

In summary the hadronization effects of the JADE algorithm and its sim- 
plest Lorentz invariant generalization to massive jets differ strongly around the 
direction of the incoming proton. By adjusting the 2;*-cut properly the dif- 
ferences disappear almost. The hadronization effects from the proton remnant 
are controlable without any special treatment of the proton remnant during the 
clustering procedure. The Lorentz invariant generalization of the JADE algo- 
rithm, derived in section 2, is sensitive to hadronization effects and might be 
useful in jet physics to test hadronization models experimentally. 
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Figure 1: Jet algorithm 1: a) pr distribution (in [GeT^]) of jet 1 (hadron level) 
in the hadronic center of mass frame. The hatched part shows hadronization 
effects with z* < 0.3. b) Scatter plot oflflie transverse momentum (in [Gel/]) 
of jet 1 in the hadronic center of mass frame: without a z*-cut. c) As b) but 
with the cut z* > 0.3. 
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Figure 2: Jet algorithm 2: a) pr distribution (in [GeF]) of jet 1 (hadron level) 
in the laboratory frame. The hatched part shows hadronization effects with 
z* < 0.3. b) Scatter plot of the transv^r^e momentum px (in [GeT^]) of jet 1 in 
the laboratory frame: without a z*-cut. c) As b) but with the cut z* > 0.3. 




;ure 3: Jet algorithm 1: scatter plot of 77 for 2+1 hadron jets: a) without 
-cut, b) with the cut z* > 0.3. 
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;ure 4: Jet algorithm 2: scatter plot of 77 for 2+1 hadron jets: a) without 
-cut, b) with the cut z* > 0.3. 



